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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLIG 

(A) NA^E: DOMPE' S.p.A. 

(B) STREET: Via Campo di Pile 

(C) CITY\ L'AQUILA 

(E) COUNTRY: ITALY 

(F) POSTaAcODE (ZIP): 67100 



(ii) TITLE. OF I] 



'ION: RECOMBINANT PROTEINS DERIVED FROM HGF AND MSP 



(iii) NUMBER OF SEQUpiCES: 4 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE\ Floppy disk 

(B) COMPUTER: I EM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Pat&ntln Release #1.0, Version #1.30 (EPO) 
(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1725 bas£ pairs 

(B) TYPE: nucleic aci) 

(C) STRANDEDNESS : douiue 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



ATGTGGGTGA CCAAACTCCT GCCAGCCCTG CT< 



\GC ATGTCCTCCT GCATCTCCTC 60 



CTGCTCCCCA TCGCCATCCC CTATGCAGAG GGACAAAGGA AAAGAAGAAA TACAATTCAT 120 



GAATTCAAAA AATCAGCAAA GACTACCCTA AT< 



lTAG ATCCAGCACT GAAGATAAAA 180 



ACCAAAAAAG TGAATACTGC AGACCAATGT GCTAAT/vGAT GTACTAGGAA TAAAGGACTT 240 



CCATTCACTT GCAAGGCTTT TGTTTTTGAT AAAGCAA( 
TTCAATAGCA TGTCAAGTGG AGTGAAAAAA GAATTT< 



AACAATGCCT CTGGTTCCCC 300 
ATGAATTTGA CCTCTATGAA 360 



AACAAAGACT ACATTAGAAA CTGCATCATT GGT AAAGGAC\ GCAGCTAC AA GGGAACAGTA 420 

TCTATCACTA AGAGTGGCAT CAAATGTCAG CCCTGGAGTT CCATGATACC ACACGAACAC 480 

AGCTATCGGG GTAAAGACCT ACAGGAAAAC TACTGTCGAA ATCCTCGAGG GGAAGAAGGG 540 

GGACCCTGGT GTTTCACAAG CAATCCAGAG GTACGCTACG AAOTCTGTGA CATTCCTCAG 600 
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TGOTCAGAAG 


TTGAATGCAT 


GACCTGCAAT 


GGGGAGAGTT 


ATCGAGGTCT 


CATGGATCAT 


C C t~\ 

660 


ACAGAATCAG 


GCAAGATTTG 


TCAGCGCTGG 


GATCATCAGA 


CACCACACCG 


GCACAAATTC 


720 


TTGCCTGAAA 


GATATCCCGA 


CAAGGGCTTT 


GATGATAATT 


ATTGCCGCAA 


TCCCGATGGC 


780 


CAGCCGAGGC 


CATGGTGCTA 


TACTCTTGAC 


CCTCACACCC 


GCTGGGAGTA 


CTGTGCAATT 


840 


AAAACATGCG 


\CTGACAAAGC 


TTCGGGCGGT 


GGCGGTTCTG 

> 


GTGGCGGTGG 


CTCCGGCGGT 


900 


GGCGGTTCTC 


tXgagggaca 


AAGGAAAAGA 


■■-■■* ■ * *"•-...< 
AGAAATACAA 


TTCATGAATT 


CAAAAAATCA 


' 960 



GCAAAGACTA CCCTAATCAA AATAGATCCA GCACTGAAGA TAAAAACCAA AAAAGTGAAT 1020 
ACTGCAGACC AATGT*GCTAA TAGATGTACT AGGAATAAAG GACTTCCATT CACTTGCAAG 1080 
GCTTTTGTTT TTGATAAAGC AAGAAAACAA TGCCTCTGGT TCCCCTTCAA TAGCATGTCA 1140 
AGTGGAGTGA AAAAAGAATT TGGCCATGAA TTTGACCTCT ATGAAAACAA AGACTACATT 1200 
AGAAACTGCA TCATTGGTAaVgGACGCAGC TACAAGGGAA CAGTATCTAT CACTAAGAGT 1260 
GGCATCAAAT GTCAGCCCTG GAGTTCCATG ATACCACACG AACACAGCTA TCGGGGTAAA 1320 
GACCTACAGG AAAACTACTG TCGAAATCCT CGAGGGGAAG AAGGGGGACC CTGGTGTTTC 1380 
ACAAGCAATC CAGAGGTACG CTACGA^GTC TGTGACATTC CTCAGTGTTC AGAAGTTGAA 1440 
TGCATGACCT GCAATGGGGA GAGTTATSGA GGTCTCATGG ATCATACAGA ATCAGGCAAG 1500 
ATTTGTCAGC GCTGGGATCA TCAGACACCX CACCGGCACA AATTCTTGCC TGAAAGATAT 1560 
CCCGACAAGG GCTTTGATGA TAATTATTGC \gCAATCCCG ATGGCCAGCC GAGGCCATGG 1620 
TGCTATACTC TTGACCCTCA CACCCGCTGG GAOTACTGTG CAATTAAAAC ATGCGCTGAC 1680 
AAAGCTGACG ACGACGACAA ACACCACCAC CACCACCACC ACTAG 1725 
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(2) I INFORMATION FOR SEQ ID NO: "2: 

(ijVEQUENCE CHARACTERISTICS: 
\A ) . LENGTH : 574 amino acids 
(fe) TYPE: amino acid 
(CO STRANDEDNESS : single 
( D X TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL : NO 

• (iv) ANT3T-SENSE: NO 



(xi) SEQUENCEV DESCRIPTION: SEQ ID NO: 2: 

Met Trp Val Tlor Lys Leu Leu Pro Ala Leu Leu Leu Gin His Val Leu 
1 \ 5 10 15 



t3i i 



Leu His Leu LeuXLeu Leu Pro lie Ala lie Pro Tyr Ala Glu Gly Gin 

20 \ 25 30 

Arg Lys Arg Arg Asn Thr lie His Glu Phe Lys Lys Ser Ala Lys Thr 
35 \ 40 45 



Thr Leu lie Lys IlA Asp Pro Ala Leu Lys lie Lys Thr Lys Lys Val 
50 \ 55 60 



3 s : 



Asn Thr Ala Asp Gin ffys Ala Asn Arg Cys Thr Arg Asn Lys Gly Leu 
65 70 75 80 



Pro Phe Thr Cys Lys Ala Phe Val Phe Asp Lys Ala Arg Lys Gin Cys 

85 \ 90 95 



Leu Trp Phe Pro Phe Asn Eer Met Ser Ser Gly Val Lys Lys Glu Phe 

100 \ 105 110 

Gly His Glu Phe Asp Leu tA: Glu Asn Lys Asp Tyr lie Arg Asn Cys 
115 \120 125 

He He Gly Lys Gly Arg SerVyr Lys Gly Thr Val Ser He Thr Lys 
130 135 \ 140 



Ser Gly He Lys Cys Gin Pro 
145 150 



Ser Ser Met He Pro His Glu His 
155 160 



Ser Tyr Arg Gly Lys Asp Leu GlnVGlu Asn Tyr Cys Arg Asn Pro Arg 

165 \ 170 175 

Gly Glu Glu Gly Gly Pro Trp Cys Mie Thr Ser Asn Pro Glu Val Arg 

180 1B5 190 



Tyr Glu Val Cys Asp He Pro Gin 
195 200 



Ser Glu Val Glu Cys Met Thr 

205 



Cys Asn Gly Glu Ser Tyr Arg Gly Leu Viet Asp His Thr Glu Ser Gly 
210 215 \ 220 
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Lys lite Cys Gin Arg Trp Asp His Gin Thr Pro His Arg His Lys Phe 
225 \ 230 235 240 

Leu Pro\Glu Arg Tyr Pro Asp Lys Gly Phe Asp Asp Asn Tyr Cys Arg 

245 250 255 

Asn Pro AW Gly Gin Pro Arg Pro Trp Cys Tyr Thr Leu Asp Pro His 



260 



265 



270 



a i 



Z=3 - 



9~ 



'- 



E 7" 



Thr Arg Tr^ Glu Tyr Cys Ala He Lys Thr Cys Ala Asp Lys Ala Ser 
275\ 280 • 285 

Gly Gly Gly fely Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Leu 
290 \ 295 300 

Glu Gly Gin Arb Lys Arg Arg Asn Thr He His Glu Phe Lys Lys Ser 
305 \ 310 315 320 

Ala Lys Thr Thr \Leu He Lys He Asp Pro Ala Leu Lys He Lys Thr 

25 330 335 

Lys Lys Val Asn Tkr Ala Asp Gin Cys Ala Asn Arg Cys Thr Arg Asn 

340 \ 345 350 

Lys Gly Leu Pro Phe\ Thr Cys Lys Ala Phe Val Phe Asp Lys Ala Arg 
355 \ 360 365 

Lys Gin Cys Leu Trp ittie Pro Phe Asn Ser Met Ser Ser Gly Val Lys 
370 \ 375 380 

Lys Glu Phe Gly His Gib Phe Asp Leu Tyr Glu Asn Lys Asp Tyr He 
385 39Q 395 400 

Arg Asn Cys He He Gly \ Lys Gly Arg Ser Tyr Lys Gly Thr Val Ser 

405 \ 410 415 



He Thr Lys Ser Gly He 

420 



Cys Gin Pro Trp Ser Ser Met He Pro 
425 430 



His Glu His Ser Tyr Arg Gly Lys Asp Leu Gin Glu Asn Tyr Cys Arg 
435 \ 440 445 



Asn Pro Arg Gly Glu Glu Gly\Gly 
450 455 



Pro Trp Cys Phe Thr Ser Asn Pro 

460 



Glu Val Arg Tyr Glu Val Cys Aiep He Pro Gin Cys Ser Glu Val Glu 

465 470 \ 475 480 

Cys Met Thr Cys Asn Gly Glu SeA Tyr Arg Gly Leu Met Asp His Thr 

485 \ 490 495 

Glu Ser Gly Lys He Cys Gin Arg trp Asp His Gin Thr Pro His Arg 

500 5KD5 510 



His Lys Phe Leu Pro Glu Arg Tyr Prt> Asp Lys Gly Phe Asp Asd Asn 
515 520 \ 525 
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Tyr Cys Arg Asn Pro Asp Gly Gin Pro Arg Pro Trp Cys Tyr Thr Leu 

SQO 535 540 

Asp Pr& His Thr Arg Trp Glu Tyr Cys Ala lie Lys Thr Cys Ala Asp 

545 \ 550 555 560 



Lys Ala App Asp Asp Asp Lys His His His His His His His 

565 570 

(2) INFORMATION VoR SEQ ID NO; 3: 



(i) SEQUENCE\ CHARACTERISTICS: 

(A) LENGTH: 1692 base pairs 

(B) TYPE: Viucleic acid 

(C) STRANDEDNESS : double 

(D) T0P0L0GV: linear 



(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL : \ NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



ATGGGGTGGC TCCCACTCCT 



'TCTG ACTCAATGCT TAGGGGTCCC TGGGCAGCGC 60 



TCGCCATTGA ATGACTTCCA AGT< 



:CGG GGCACAGAGC TACAGCACCT GCTACATGCG 120 



GTGGTGCCCG GGCCTTGGCA GGAGGAT&TG GCAGATGCTG AAGAGTGTGC TGGTCGCTGT 180 

GGGCCCTTAA TGGACTGCCG GGCCTTCCAC TACAACGTGA GCAGCCATGG TTGCCAACTG 240 

CTGCCATGGA CTCAACACTC GCCCCACACGVAGGCTGCGGC GTTCTGGGCG CTGTGACCTC 300 

TTCCAGAAGA AAGACTACGT ACGGACCTGC AtCATGAACA ATGGGGTTGG GTACCGGGGC 360 

ACCATGGCCA CGACCGTGGG TGGCCTGCCC TOTCAGGCTT GGAGCCACAA GTTCCCGAAT 420 



GATCACAAGT ACACGCCCAC TCTCCGGAAT GGC( 



lG AGAACTTCTG CCGTAACCCT 480 



GATGGCGACC CCGGAGGTCC TTGGTGCTAC ACAACAGACC CTGCTGTGCG CTTCCAGAGC 540 

TGCGGCATCA AATCCTGCCG GGAGGCCGCG TGTGTCTGGT GCAATGGCGA GGAATACCGC 600 

GGCGCGGTAG ACCGCACGGA GTCAGGGCGC GAGTGCCAGC GCTGGGATCT TCAGCACCCG 660 

CACCAGCACC CCTTCGAGCC GGGCAAGTTC CTCGACCAAG GTCTGGACGA CAACTATTGC 720 

CGGAATCCTG ACGGCTCCGA GCGGCCATGG TGCTACACtA CGGATCCGCA GATCGAGCGA 780 

GAGTTCTGTG ACCTCCCCCG CTGCGGGTCC GAGGCACAGC VcCGCCTCGA GGGCGGTGGC 840 



GGTTCTGGTG GCGGTGGCTC CGGCGGTGGC GGTTCTCTAG M 



lCAAAG GAAAAGAAGA 900 



AATACAATTC ATGAATTCAA AAAATCAGCA AAGACTACCC TAATCAAAAT AGATCCAGCA 960 
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CTGAAGATAA aaaccaaaaa agtgaatact GCAGACCAAT gtgctaatag ATGTACTAGG 1020 
AATAAAGGAC TTCCATTCAC TTGCAAGGCT TTTGTTTTTG ATAAAGCAAG AAAACAATGC 1080 
CTCTGGTTCfc CCTTCAATAG CATGTCAAGT GGAGTGAAAA AAGAATTTGG CCATGAATTT 1140 
GACCTCTATG \AAAACAAAGA CTACATTAGA AACTGCATCA TTGGTAAAGG ACGCAGCTAC 1200 



AAGGGAACAG TATCTATCAC TAAGAGTGGC ATCAAATGTC AGCCCTGGAG TTCCATGATA 1260 



\ 



CCACACGAAC AC^GCTATCG GGGTAAAGAC CTACAGGAAA ACTACTGTCG AAATCCTCGA 1320 

> 

GGGGAAGAAG GGG&ACCCTG GTGTTTCACA AGCAATCCAG- AGGTACGCTA CGAAGTCTGT 1380 



GACATTCCTC AGTGTTCAGA AGTTGAATGC ATGACCTGCA ATGGGGAGAG TTATCGAGGT 1440 
CTCATGGATC ATACAG&ATC AGGCAAGATT TGTCAGCGCT GGGATCATCA GACACCACAC 1500 
CGGCACAAAT TCTTGCOTGA AAGATATCCC GACAAGGGCT TTGATGATAA TTATTGCCGC 1560 
AATCCCGATG GCCAGCCGAG GCCATGGTGC TATACTCTTG ACCCTCACAC CCGCTGGGAG 1620 
TACTGTGCAA TTAAAACATG CGCTGACAAA GCTGACGACG ACGACAAACA CCACCACCAC 1680 



CACCACCACT AG 



1692 



( 2 ) INFORMATION FOR SE& ID NO: 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563\amino acids 

( B ) TYPE : amino Vcid 

( C ) STRANDEDNESS A single 

(D) TOPOLOGY: linfear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gly Trp Leu Pro Leu Leu\Leu Leu Leu Thr Gin Cys Leu Gly Val 
1 5 \ 10 15 



Pro Gly Gin Arg Ser Pro Leu 

20 



Asp Phe Gin Val Leu Arg Gly Thr 
25 30 



Glu Leu Gin His Leu Leu His AlW Val Val Pro Gly Pro Trp Gin Glu 
35 40 \ 45 

Asp Val Ala Asp Ala Glu Glu CysVAla Gly Arg Cys Gly Pro Leu Met 
50 55 \ 60 



Asp Cys Arg Ala Phe His Tyr Asn Val Ser Ser His Gly Cys Gin Leu 
65 70 75 80 
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Leu Pro ^rp Thr Gin His Ser Pro His Thr Arg Leu Arg Arg Ser Gly 

85 90 95 

Arg Cys As§ Leu Phe Gin Lys Lys Asp Tyr Val Arg Thr Cys lie Met 

100 105 no 

Asn Asn Gly \al Gly Tyr Arg Gly Thr Met Ala Thr Thr Val Gly Gly 
115 \ 120 125 

Leu Pro Cys GlA Ala Trp Ser His Lys Phe Pro Asn Asp His Lys Tyr 
130 \ 135 > 140 

\ . 

Thr Pro Thr Leu Arg Asn Gly Leu Glu Glu Asn Phe Cys Arg Asn Pro 
145 \ 150 155 160 

Asp Gly Asp Pro Gl\ Gly Pro Trp Cys Tyr Thr Thr Asp Pro Ala Val 

165\ 170 175 

Arg Phe Gin Ser Cys fely He Lys Ser Cys Arg Glu Ala Ala Cys Val 

180 \ 185 190 

Trp Cys Asn Gly Glu GlVt Tyr Arg Gly Ala Val Asp Arg Thr Glu Ser 
195 \ 200 205 

Gly Arg Glu Cys Gin Arg Yrrp Asp Leu Gin His Pro His Gin His Pro 
210 k5 220 

Phe Glu Pro Gly Lys Phe L&u Asp Gin Gly Leu Asp Asp Asn Tyr Cys 
225 230 \ 235 240 

Arg Asn Pro Asp Gly Ser Glu\ Arg Pro Trp Cys Tyr Thr Thr Asp Pro 

245 \ 250 255 

Gin He Glu Arg Glu Phe Cys ^sp Leu Pro Arg Cys Gly Ser Glu Ala 

260 \ 265 270 

Gin Pro Arg Leu Glu Gly Gly Gl^ Gly Ser Gly Gly Gly Gly Ser Gly 
275 280\ 285 

Gly Gly Gly Ser Leu Glu Gly Gin )krg Lys Arg Arg Asn Thr He His 
290 295 \ 300 

Glu Phe Lys Lys Ser Ala Lys Thr TOk Leu He Lys He Asp Pro Ala 
305 310 \ 315 320 

Leu Lys He Lys Thr Lys Lys Val AsnVThr Ala Asp Gin Cys Ala Asn 

325 B30 335 

Arg Cys Thr Arg Asn Lys Gly Leu Pro F^ie Thr Cys Lys Ala Phe Val 

340 345 \ 350 

Phe Asp Lys Ala Arg Lys Gin Cys Leu Trb Phe Pro Phe Asn Ser Met 
355 360 A 365 



Ser Ser Gly Val Lys Lys Glu Phe Gly HisxSlu Phe Asp Leu Tyr Glu 
370 375 \ 380 
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Asn Lys Asp Tyr \ lie Arg Asn Cys lie He Gly Lys Gly Arg Ser Tyr 

385 \ 390 395 400 

Lys Gly Thr Val Ser He Thr Lys Ser Gly He Lys Cys Gin Pro Trp 

4^5 410 415 

Ser Ser Met He Pr^ His Glu His Ser Tyr Arg Gly Lys Asp Leu Gin 

420 \ 425 430 

Glu Asn Tyr Cys ArgUsn Pro Arg Gly g!u,G1u Gly Gly Pro Trp Cys 



435 



440 



445 



Phe Thr Ser Asn Pro Glu Val Arg Tyr Glu Val Cys Asp He Pro Gin 
450 \ 455 460 

Cys Ser Glu Val Glu Cys Met Thr Cys Asn Gly Glu Ser Tyr Arg Gly 
465 470 475 480 

Leu Met Asp His Thr Gill Ser Gly Lys He Cys Gin Arg Trp Asp His 

485 \ 490 495 

Gin Thr Pro His Arg His \Lys Phe Leu Pro Glu Arg Tyr Pro Asp Lys 

500 \ 505 510 

Gly Phe Asp Asp Asn Tyr Ays Arg Asn Pro Asp Gly Gin Pro Arg Pro 
515 \ 520 525 

Trp Cys Tyr Thr Leu Asp Pno His Thr Arg Trp Glu Tyr Cys Ala lie 
530 535 540 

Lys Thr Cys Ala Asp Lys Ala\ Asp Asp Asp Asp Lys His His His His 
545 550 \ 555 560 



His His His 



